Efficiency of different measures for defining the applicability domain of classification models

نویسندگان

  • Waldemar Klingspohn
  • Miriam Mathea
  • Antonius Ter Laak
  • Nikolaus Heinrich
  • Knut Baumann
چکیده

The goal of defining an applicability domain for a predictive classification model is to identify the region in chemical space where the model's predictions are reliable. The boundary of the applicability domain is defined with the help of a measure that shall reflect the reliability of an individual prediction. Here, the available measures are differentiated into those that flag unusual objects and which are independent of the original classifier and those that use information of the trained classifier. The former set of techniques is referred to as novelty detection while the latter is designated as confidence estimation. A review of the available confidence estimators shows that most of these measures estimate the probability of class membership of the predicted objects which is inversely related to the error probability. Thus, class probability estimates are natural candidates for defining the applicability domain but were not comprehensively included in previous benchmark studies. The focus of the present study is to find the best measure for defining the applicability domain for a given binary classification technique and to determine the performance of novelty detection versus confidence estimation. Six different binary classification techniques in combination with ten data sets were studied to benchmark the various measures. The area under the receiver operating characteristic curve (AUC ROC) was employed as main benchmark criterion. It is shown that class probability estimates constantly perform best to differentiate between reliable and unreliable predictions. Previously proposed alternatives to class probability estimates do not perform better than the latter and are inferior in most cases. Interestingly, the impact of defining an applicability domain depends on the observed area under the receiver operator characteristic curve. That means that it depends on the level of difficulty of the classification problem (expressed as AUC ROC) and will be largest for intermediately difficult problems (range AUC ROC 0.7-0.9). In the ranking of classifiers, classification random forests performed best on average. Hence, classification random forests in combination with the respective class probability estimate are a good starting point for predictive binary chemoinformatic classifiers with applicability domain. Graphical abstract .

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Finding stability regions for preserving efficiency classification of variable returns to scale technology in data envelopment analysis

This paper addresses issue of sensitivity of efficiency classification of variable returns to scale (VRS) technology for enhancing the credibility of data envelopment analysis (DEA) results in practical applications when an additional decision making unit (DMU) needs to be added to the set being considered. It also develops a structured approach to assisting practitioners in making an appropria...

متن کامل

Interval network data envelopment analysis model for classification of investment companies in the presence of uncertain data

The main purpose of this paper is to propose an approach for performance measurement, classification and ranking the investment companies (ICs) by considering internal structure and uncertainty. In order to reach this goal, the interval network data envelopment analysis (INDEA) models are extended. This model is capable to model two-stage efficiency with intermediate measures i...

متن کامل

Uniquely classifying flexible measures by a CWS model

Data Envelopment Analysis (DEA) deals with evaluating a set of decision-making units (DMUs) based on nonparametric mathematical approaches. In classical DEA models, the given set of factors of DMUs is divided into two categories, inputs and outputs, while in some practical problems there are some measures whose membership to these categories is unclear. It means these factors treat different in...

متن کامل

Using MODEA and MODM with Different Risk Measures for Portfolio Optimization

The purpose of this study is to develop portfolio optimization and assets allocation using our proposed models. The study is based on a non-parametric efficiency analysis tool, namely Data Envelopment Analysis (DEA). Conventional DEA models assume non-negative data for inputs and outputs. However, many of these data take the negative value, therefore we propose the MeanSharp-βRisk (MShβR) model...

متن کامل

Developing Fuzzy Models for Estimating the Quality of VoIP

This paper presents a novel method for modeling the one-way quality prediction of VoIP, non-intrusively. Intrusive measures of voice quality suffer from common deficiency that is the need of reference signal for evaluating the quality of voice. Owing to this lack, a great deal of effort has been recently devoted for modeling voice quality prediction non-intrusively according to quality degradat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2017